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Abstract 



' This paper considers the setup of a parallel MIMO relay network in which K relays, each equipped 

m '. 

with antennas, assist the transmitter and the receiver, each equipped with M antennas, in the half- 



duplex mode, under the assumption that N > M. This setup has been studied in the literature like in 
[1], [2], and [3]. In this paper, a simple scheme, the so-called Incremental Cooperative Beamforming, is 
introduced and shown to achieve the capacity of the network in the asymptotic case of K oo with a 
gap no more than O (^j^^^^ j ■ This result is shown to hold, as long as the power of the relays scales as 
^ log^K) ^ pinally, the asymptotic SNR behavior is studied and it is proved that the proposed scheme 



lO ■ achieves the full multiplexing gain, regardless of the number of relays. 

m 
o 

^ ■ I. Introduction 

^ ■ A. Motivation 



In recent years, Multiple-input Multiple-output (MIMO) wireless systems have received signif- 
icant attention. It has been shown that MIMO wireless systems have the ability to simultaneously 
enhance the multiplexing gain (degrees of freedom) and the diversity (reliability) of the Rayleigh 
fading channel [4], [5], [6]. The relay channel, which was first introduced by Van-der Meulen in 
1971 [7], has been reconsidered in recent years to improve the coverage, reliability, and reduce 
the interference in the multi-user wireless networks. The main idea is to employ some extra 
nodes in the network to aid the transmitter/receiver in sending/receiving the signal to/from the 
other end. In this way, the supplementary nodes act as (spatially) distributed antennas assisting 
the signal transmission and reception. 

After some recent information-theoretic results on the MIMO point-to-point Rayleigh fading 
channels [4], [5], [6], there has been growing interest in studying the impact of MIMO systems 
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in more complex wireless networks. Some promising results have been published on MIMO 
Multiple-Access and Broadcast channels in [8], [9], [10], [11], and [12]. However, there are still 
only a few results known concerning the MIMO relay networks. Moreover, no capacity-achieving 
strategy is known for the Gaussian relay channel. 

This paper analyzes the performance of a parallel MIMO relay network. Our focus is on the 
Amplify and Forward (AF) strategy. Not only the AF strategy offers low complexity and delay, 
but also it performs well in our setup. 

B. History 

The classical relay channel was first introduced by Van-der Meulen in 1971 [7]. In [7], a node 
defined as the relay enhances the transmission of information between the transmitter and the 
receiver. The most important relevant results have been published by Cover and El Gamal [13]. 
In [13], two different coding strategies are introduced. In the first strategy, originally named 
"cooperation", and later known as "decode-and-forward" (DF), the relay decodes the transmitted 
message and cooperates with the transmitter to send the message in the next block. In the 
second strategy, known as "compress-and-forward" (CF), the relay compresses the received signal 
and sends it to the receiver. The performance of the DF strategy is limited by the quality of 
the transmitter-to-relay channel, while CF's performance is mostly restricted by the quality of 
the relay-to-receiver channel [13]. The drawback of using CF strategy is that it employs no 
cooperation between the transmitter and the relay at the receiver side. Hence, the CF strategy 
is unable to exploit the power boosting advantage due to the coherent addition of the signal of 
the transmitter and the relay [13]. 

More recently, several extensions of the relay channel have been considered, e.g. in [14]-[17]. 
Some of these extensions consider a multiple-relay scenario in which several nodes relay the 
message. The parallel relay channel is a special case of the multiple relay channel in which the 
relays transmit their data directly to the receiver. Besides studying the well-known "compress- 
and-forward" and "decode-and-forward" strategies, the authors in [14], [15] have also studied 
the "amplify-and-forward" strategy where the relays simply amplify and transmit their received 
data to the receiver. Despite its simplicity, the AF strategy achieves a good performance. In fact, 
[14] shows that AF outperforms other strategies in many scenarios. Moreover, [15] proves that 
AF achieves the capacity of the Gaussian (single antenna) parallel relay network as the number 
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of relays increases. 

References [1], [2] extend the work of [15] to the MIMO Rayleigh fading parallel relay 
network. Unlike the single antenna parallel relay scenario, in this case the AF multipliers are 
matrices rather than scalars. Hence, finding the optimum AF matrices becomes challenging. 
Reference [1] has proposed a coherent AF scheme, called "matched filtering", and proves that 
this scheme follows the capacity of the channel with a constant gap in terms of the number 
of relays in the asymptotic case of A' ^ oo. They also show that the achievable rate of AF 
in parallel MIMO relay network grows linearly with the number of antennas (reflecting the 
multiplexing gain) and grows logarithmically in terms of the number of relays (reflecting the 
distributed array gain [1]). 

Reference [3] presents a new AF scheme using the QR decomposition of the forward and 
backward channels in each relay that outperforms the other AF schemes for practical number 
of relays. 

C. Contributions and Relation to Previous Works 

In this paper, we consider the AF strategy in the parallel MIMO relay network. The channel 
is assumed to be Rayleigh fading and the communication takes place in the half-duplex mode 
(i.e. the relays can not transmit and receive simultaneously). We propose a new AF protocol 
called "Cooperative Beamforming Scheme" (CBS). Considering the uplink channel (from the 
transmitter to the relays) as a point-to-point channel, in CBS the relays cooperatively multiply 
the channel matrix with its left eigenvector matrix. Hence, the relays act like the spatially 
distributed antennas at the equivalent receiver. The interesting point is that to perform such 
an operation, each relay only needs to know its corresponding sub-matrix of the beamforming 
matrix. For the outputs to be coherently added at the receiver end, each relay has to apply zero 
forcing beamforming to its corresponding downlink channel (the channel from each relay to 
the receiver). Here, the interesting result is that the overall channel from the transmitter to the 
receiver becomes diagonal and the overall Gaussian noise has independent components. 

We show that the proposed scheme is optimum in the case of having negligible noise in the 
downlink channel. However, the downlink noise would degrade the system performance when 
one of the relays' downlink channels is ill-conditioned. To enhance the performance of CBS 
in general scenarios, this work introduces a variant of CBS called "Incremental Cooperative 
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Beamforming Scheme" (ICBS). In ICES, the relays with ill-conditioned downlink channels are 
turned off. This strategy improves the overall point-to-point channel from the transmitter to the 
receiver. However, an interference term due to turning some of the relays off will be included 
in the equivalent point-to-point channel. 

It is shown that for asymptotically large number of relays, one can simultaneously mitigate the 
downlink noise and the interference term due to the tumed-off relays. As a result, the achievable 
rate of ICBS converges to the capacity of parallel MIMO relay network with a gap which scales 



their scheme can asymptotically (K oo) achieve the capacity up to 0(1). Also, our numerical 
results show that the achievable rate of ICBS converges rapidly to the capacity, even for moderate 
number of relays. Our results also demonstrate that the achievable rate of ICBS, the maximum 
achievable rate of amplify and forward strategy, the capacity of the parallel MIMO relay network, 
and the point-to-point capacity of the uplink channel converge to each other for asymptotically 
large number of relays. 

We also show that the same result can be achieved by ICBS, as long as the power of the 
relays scales as u (^log^(ir)) Q. Finally, by analyzing the asymptotic SNR behavior of the 
proposed scheme, it is proved that, unlike the matched filtering scheme of Bcskei-Nabar-Oyman- 
Paulraj (BNOP) which results in a zero multiplexing gain, our proposed scheme achieves the 
full multiplexing gain, regardless of the number of relays. 

The rest of the paper is organized as follows. In section II, the system model is introduced. 
In section III, the proposed AF scheme is described. Section IV is dedicated to the asymptotic 
analysis of the proposed scheme. Simulation results are presented in section V. Finally, section 
VI concludes the paper. 

D. Notation 

Throughout the paper, the superscripts and * stand for matrix operations of transposition, 
conjugate transposition, and element-wise conjugation, respectively. Capital bold letters represent 
matrices, while lowercase bold letters and regular letters represent vectors and scalars, respec- 
tively. ||v|| denotes the norm of the vector v while ||A|| represents the frobenius norm of the 

' /(w) — Lj{g{n)) is equivalent to Iim„^oo — oo 




Stronger than the result of [1] and [2] in which they show that 
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matrix A. |A| denotes the determinant of the matrix A while ||A||^ represents the maximum 
absolute value among the entries of A. The notation A^^ stands for the pseudo inverse of the 
matrix A. The notation A ^ B is equivalent to B — A is a positive semi-definite matrix. For any 
functions /(n) and g{n), f{n) = 0{g{n)) is equivalent to lim^^oo < oo, f{n) = o{g{n)) is 



fin) 



gin) 



0, f{n) = ^{g{n)) is equivalent to lim^^o^ S > 0, f{n) > g{n 



equivalent to lim^. 

is equivalent to lim„^oo ^ > 1, f{n) = uj{g{n)) is equivalent to lim^^o^ = OO' /(^) ~ 
g{n) is equivalent to lim„^oo ^ = 1 and f{n) = Q{g{n)) is equivalent to lim„_oo = c, 
where < c < oo. 

IL System Model 

The system model, as in [1], [2], and [3], is a parallel MIMO relay network with two-hop 
relaying and half-dulplexing between the uplink and downlink channels. In other words, the data 
transmission is performed in two time slots; in the first time slot, the signal is transmitted from 
the transmitter to the relays, and in the second time slot, the relays transmit data to the receiver. 
Note that there is no direct link between the transmitter and the receiver in this model. The 
transmitter and the receiver are equipped with M antennas and each of the relays is equipped 
with antennas. Throughout the paper, we assume that > M. The channel between the 
transmitter and the relays and the channel between the relays and the receiver are assumed 
to be frequency flat block Rayleigh fading. The channel from the transmitter to the kth relay, 
1 < k < K, is modeled as 

rfc = HfcX + nfc, (1) 
and the downlink channel is modeled as 

K 

y = ^Gfctfc + z, (2) 

fe=i 

where the channel matrices and Gk are i.i.d. complex Gaussian matrices with zero mean 
and unit variance, ~ CJ\f{0, 1^) and z ~ CJ\f{0, 1m) are Additive White Gaussian Noise 
(AWGN) vectors, and are the A;th relay's received and transmitted signal, respectively, and 
X and y are the transmitter's and the receiver's signal, respectively. and are of the sizes 
NxM and MxN, respectively (figure [T])- 

The task of amplify and forward (AF) relaying is to find the matrix for each relay to be 
multiplied by its received signal to produce the relay's output as = Ffcr^. In this way, the 
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entire source-destination channel is modeled as 



(3) 



.fc=l / k=l 

In addition, the power constraints E[x^x] < Pg and Ex,nfc[t^tfc] < Pr must be satisfied for 
the transmitted signals of the transmitter and the relays, respectively. We assume Pr = Pg = P 
throughout the paper, except in Theorem |2l where we study the case Pr < Pg = P- 



M 
antennas 




First Hop 



Second Hop 



Relays with N received/transmitted antennas 



t 



• M 

• antennas 



Rx 



Fig. 1. A schematics of a parallel MIMO half-duplexing relay network 



III. Proposed Method 

A. Cooperative Beamforming Scheme 

The equivalent uplink channel can be represented as = [Hf |H|^|- ■ -IH^]^. By applying 
Singular Value Decomposition (SVD) to H, we have H = UA^V^. Therefore, the diagonal 
matrix A has at most M nonzero diagonal entries corresponding to the nonzero singular values 
of H. Consequently, we can rearrange the SVD such that U is of size NK x M while V and A 
are M X M matrices. U can be partitioned to M x sub-matrices as U = [Uf |U|^|- ■ -lU^]^ . 
Suppose the kih. relay multiplies its received signal by Uf^, then passes it through the zero-forcing 
matrix G\, and finally amplifies it with a constant scalar a independent of k\ equivalently, we 
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have Ffc = aG^U|^. At the receiver side, we have (figure [2l) 



K 

a ^ Gfctfc + z 

k=l 
K 

fc=i 
aU^r + z 



a 



(Hx + n) + z 



n, 



(4) 



where n 



[vlI\vlI 



I T 1 



[rf|r^|---|r^]^, and n„ = U^n ~ CM{Q,Im)- If the 



transmitter beamforms its data vector as x = Vx', the end-to-end channel becomes 



y = a ( A 2 X' 



+ z. 



(5) 



Equation © shows that the end-to-end channel is diagonal and the noise vector is white 
Gaussian. Note that the complexity of the decoder in such a channel is linear in terms of the 
number of transmitter's antennas, M, and also there is no interference among different data 
streams. In fact, the output signals of the relays not only do not interfere with each other, but 
also add constructively at the receiver side. Moreover, as it is shown in section IV, for a ^ oo, 
the achievable rate of such a scheme converges to the point-to-point capacity of the uplink 
channel which is shown to be an upper-bound on the capacity of the parallel relay system. 

The problem is that the value of a is dominated by 



a 



P 



nfe 



GtUfrfc 



(6) 



This guarantees that the output power of all relays is less than or equal to P. However, by 
applying Q, the value of a could be small in the cases where the downlink channel of any of the 
relays is ill conditioned. This means that while the output power of the worst relay (according 
to ^) is equal to the maximum possible value, i.e. P, there may be many relays with the output 
power far less than P. This phenomenon degrades the performance, as in this case the downlink 
noise, z, would be the dominant noise in ©. 
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y = a ( A5x' + n„) + z 



Fig. 2. Cooperative Beamforming Scheme 



B. Incremental Cooperative Beamforming Scheme (ICBS) 

As the number of relays increases, we expect (as shown in ^) to have smaller values of 
a with high probability. In other words, there is a higher chance of having at least one ill- 
conditioned downlink channel among the relays. In this case, we can select a subset of relays 
which are in good condition and turn off the rest. In this variant of CBS, we select a subset of 

2" 



we activate the 



relays which results in a high value of a. Defining /3k = Ex.nfe 
relays which satisfy (3k < P, where P is a predefined threshold. In this manner, it is guaranteed 
that a > ^J^. This improvement in the value of a is realized at the expense of turning off some 
of the relays, creating interference in the equivalent point-to-point channel. More precisely, by 
defining A = {k\(3k > (3}, have (figure [3]) 



a 



fAi-$:ufH„v)x' 

\ k£A J 



Uf n, I + z. 



(7) 



As (|7]) shows, by decreasing the value of (3, one can guarantee a large value of a while increasing 
the gap of the equivalent channel matrix to A^. It will be shown in the next section that for 
large number of relays, it is possible to guarantee both having a large value of a and a small 
deviation from A^. Moreover, we show that by appropriately choosing the value of (3, the rate 
of such a scheme would be at most O ( tzzht^ ) below the corresponding capacity. 



log(i^) 
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Fig. 3. Incremental Cooperative Beamforming Scheme 



C. A Note on CSI Assumption 

In the BNOP scheme, it is assumed that each relay knows its corresponding forward and 
backward channels, i.e. Ha; and Gk, and at the receiver side, the effective signal power and 
the effective interference plus noise power are known for each antenna. However, in CBS and 
ICES, it is assumed that the transmitter knows the uplink channel, i.e. Hi, ■ ■ ■ , H^', and sends 
the N X M matrix Ufc to the /c'th relay, k = 1, ■ ■ ■ , K. This assumption is reasonable when the 
uplink channel is slow-fading; for example, in the case that the transmitter and all the relay nodes 
are fixed. Furthermore, similar to the BNOP scheme, we assume that each relay knows its forward 
channel, i.e. Gfc. In addition, in CBS, it is assumed that the value of a is set by negotiating 
between the relays through sending their corresponding to the transmitter. This assumption is 
not required in ICBS, as the value of a can be set as a = ^J^, where (3 is a predefined threshold. 
Finally, in both CBS and ICBS, it is assumed that the receiver has the perfect knowledge about 
the equivalent point-to-point channel from the transmitter to the receiver. This information can 
be obtained through sending pilot signals by the transmitter, amplified and forwarded at the relay 
nodes in the same manner as the information signal. In CBS, as the equivalent point-to-point 
channel is diagonal, this assumption is equivalent to knowing the equivalent signal to noise ratio 
at each antenna. 
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IV. Asymptotic Analysis 

In this section, we consider the asymptotic behavior {K oo) of the achievable rate of 
ICBS. We show that by properly choosing the value of (3, the achievable rate of ICBS converges 
rapidly to the capacity (the difference approaches zero as O (y Xog(K) )^- sequence of proof is 
as follows. In Lemma 1, we relate P [ii > (^] (the probability that the norm of interference term 
defined in equation (|7]) exceeds a certain threshold) to ¥[k E A] (the probability of turning off 
a relay) and P[||Ufc|p > 7] (the probability of having a sub-matrix with a large norm in the 
unitary matrix obtained from the SVD of H). In Lemma 2, we bound P[||Ufc|p > 7]. In Lemma 
3, we bound F[k G As a result, in Lemma 4, we show that by properly choosing the value 
of P, with high probability, one can simultaneously reduce the effect of the interference to o{K) 
and maintain a large value of a. In Lemma 5, we show that with high probability, the minimum 
singular value of H scales as 0{K). Putting Lemmas 4 and 5 together, with high probability, the 
ratio of the power of interference to the power of signal approaches zero. Finally, in Theorem 1, 
we prove the main result by showing that the achievable rate of ICBS converges to the capacity 
of the uplink channel. This is proved using the fact that the capacity of the uplink channel is 
an upper-bound on the capacity of parallel MIMO relay network. As a consequence stated in 
corollary 1, the achievable rate of ICBS, the achievable rate of the AF protocol, the point-to- 
point capacity of the uplink channel, and the capacity of the parallel MIMO relay network are 
asymptotically equal. As another consequence, the difference of the rates scales as O(jj^^p^). 

Using the proof of Lemma 4 and Theorem 1, Theorem 2 shows that as long as the power 
of relays behaves as Pr{K) = a; log^ {K))^ the same rate is achievable by ICBS. Finally, in 
Theorem 3, we study the asymptotic SNR behavior of CBS and ICBS, and show that, unlike 
the matched filtering scheme of BNOP, CBS and its variant achieve the full multiplexing gain, 
regardless of the number of relays. 

Lemma 1 Consider a parallel MIMO relay network with K relays using ICBS. We have 

MNK'^ 

F[v>i] < -^{nBK]+inM). (8) 

where v is defined as v = U^^^HfeU^ , and and B/^ are indicator variables defined as 

Ak = {k E A) and Bk = (HUfcp > 7), respectively. 
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Proof: Let us define = [U^ |A; G A\ and = \ill\k e A\ .We have 

W E_[||U2H^P] 



W E[||U^|n|H^f] 



W E[||U^f||Hf] 
(J E[||U^f]E[||H| 
MA^KE[||U^f] 



(9) 



Here, Markov inequality is applied to derive inequality (a). (6) is obtained by applying the norm 
product inequality on matrice^. (c) results from the fact that ||H_4||^ < HHp. Finally, equation 
(d) follows from the fact that the left unitary matrix, i.e. U, resulted from the SVD of an i.i.d. 
complex Gaussian matrix, is independent of its singular value matrix, i.e. , [19], and the fact 
that II Hp is a function of A. 

To upper-bound E [||U^|p], we have 



E 



K 



k=l 



(a) 



KE [AfcllUfcf] 

= i^E[iiUfcinA,] 

= KE[\\lJ4'\Ak,Bk]nAk,Bk] 

+ KE[\\Vkf\Ak,Bl]F[Ak,Bl] 

< K{F[Ak,Bk] + -fF[Ak,B'^]) 

< K{F[Bk]+-fF[Ak]), 



(10) 



where (a) follows from the fact the channels are symmetric, (b) follows from the fact that the 
norm of is upper-bounded by 1 and conditioned on the event B^, it is upper-bounded by 7, 



^Assuming A and B two matrices of sizes mxn and n x k, correspondingly, we liave ||AB|p < ||A|p||B|p [18] . 
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and finally (c) follows from the basic probability inequalities. Combining inequalities Q and 
(flOl) completes the proof. ■ 



Lemma 2 Consider a KN x M Unitary matrix U, where its columns Uj, i = 1, ■ ■ ■ , M, are 
isotropically distributed unit vectors in C^^^^. Let W an arbitrary N x M sub-matrix of 
U. r/zen, /or a predefined value of M and N and assuming 7 = u; (-^), as K ^ 00, we have 

P [||wf > 7] = O [{K-fY''-'^ e-^^^) (11) 

Proof: See Appendix A. ■ 

Lemma 3 For a small enough value of S, we have 

y <P[Sfc]+CiV^ + C2e^^, (12) 



where S = ^, and ci , C2 and d are positive constant parameters independent of K, (3, and 7. 
Proof: Assume k'\h relay is off. Hence, we have 



< A-L(G,)||U,f (l + F||H,f). (13) 



Here, (a) follows from the product norm inequality of matrices and independency of the noise 
from other random variables in the system. Defining the events 

Ck = (^A^i„(Gfe)<i^(l + P||Hfef)^, (14) 
= (A,,in(Gfe)<<5(l + P||Hfc||2)), (15) 



we have 



nM < nck] 



< ¥[B,]+¥[Ck\Bl]¥[Bl] 



id) 

< F[Bk]+F[Dk], (16) 

where (a) results from (fT3l) . (b) and (d) follow from basic probability inequalities and (c) follows 
from the fact that conditioned on ||Ufc|p < 7, we have + P||Hfc|n < 5 (1 + P||Hfcf), 
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which incurs that Ck C D^. Defining as the submatrix defined on the first M rows of G^, 
we have 



< p 



(a) 
< 



(b) 
< 



ic) 



P 



P 



A„.in(Gfc)<V^]+P 
Amin(Wfc)<V^] +P 



P||H.f,-Lj 

1 ■ 



> 



T{MN) 



X 



P\Vs 



'MN-l 

E 

m=0 



< M^5 + MNe ^) 



(17) 



Here, (a) results from the union bound, (b) results from the fact that Amin (Gfc) > Amin (W^) 
which can be shown easily based on the definition of the singular values of a matrix, (c) 
results from applying the probability density function of the minimum singular value of square 
i.i.d. complex Gaussian matrix, derived in [20], and also the fact that HHfcp has Chi-Square 
distribution with 2MN degrees of freedom, and finally, (d) results from the assumption that 
6 is small enough such that Vm,0 < m < MN, we have - l)j < e^(^~^). By 

Combining the results of (fT6l) and (flTl) . we obtain (fT2l) and this completes the proof. ■ 
Next, we apply Lemmas 1, 2, and 3 to prove that for large values of K, by properly choosing 
the value of P, ICBS can simultaneously achieve a large value of a and reduce the interference 
to o(K), with a high probability. 



Lemma 4 By assigning (3 



\og{K) 



and 7 = ^iH^^, ICBS simultaneously achieves 



P 



V > 



K 



a = n (Vlog(A') 
log' (K) 



O 



K 



(18) 
(19) 



where v is defined in Lemma 1. 
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Proof: Having {3 = j^^^p^: the value of a would be 



P 



a 



(20) 



and this results in (fTSi) . Assuming ^ = ^^^^^^ , we have 



(a) 

F[v>^] < MNK \og^{K) ( P[5, 



(b) 

< MNK\og^{K) 



21og(i^) 
21og(A0 



ciW2 



log2(K) 



C2e 



"V 2 1og^(if) 



(21) 



(c) 

< 



2MNK\og^{K)W>[Bk] + O 



log^(K) 



K 



2MArinog'(K)0 (log(i^)) 



(e) ^^'log'W 



o 



\og\K) 



K 



K 



(22) 



Here, (a) follows from Lemma 1, [h] follows from Lemma 3, (c) follows the assumption that 
K is large enough such that 2 log(K) < K and d\/2j^^ > log(/'C) , (d) follows from Lemma 



2, and (e) follows from the fact that ^ > 2, which incurs that 



M 



K\og\K)0 ( (log(i^))(^~^) e-^'°sW 



O 



log^^^(if) 



\og\K) 



This completes the proof of Lemma 4. ■ 
Although with the threshold value stated by Lemma 4, the interference term may tend to infinity 
in terms of K, the signal term tends to infinity more rapidly. In fact, as the following Lemma 
shows, the singular values of the whole uplink channel matrix behave as 0{K) with probability 

1, as A' ^ oo. 

Lemma 5 Let A be an r x s matrix whose entries are i.i.d complex Gaussian random variables 
with zero mean and unit variance. Assume that r is fixed and s tends to infinity. Then, with 
probability one Amin(A) ~ s, or more precisely, 

log(s) \ \ . . „ / 1 



p 



A„,h,fA) ~s 1 + 



> 1-0 



s Vlog (s) I ' 



(23) 
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where Amin(A) denotes the minimum singular value of AA^. 



Proof: See Appendix B. 



Next, we prove the main theorem of this section. 

Theorem 1 By setting the threshold as j3 = iog{K) ' achievable rate of the proposed ICBS 
converges to the upper-bound capacity defined for the uplink channel. More precisely, 



pacity of the uplink channel and RjcBsiK) is the achievable rate of ICBS. 

Proof: By applying the cut-set bound theorem [21] on the broadcast uplink channel, it can 
be easily verified [1], [2] that the point-to-point capacity of the uplink channel, Cu{K), is an 
upper-bound on the capacity of the parallel MIMO relay network. Note that the factor ^ in the 
expression of Cu{K) is due to the half-duplex relaying. Define C„*(/i) = ^ log (l + ^^^f-)- 
We first show that Cu*{K) is an upper-bound for Cu{K), and then prove that a lower-bound for 
RicBs{K) converges to Cu*{K). 




Cu{K) 




max log(|li^^ + HQH^|) 
Tr{Q}<p 



(a) 




max log(|lM + H^HQ|) 
Q 

Tr{Q}<p 



(fe) 
< 




1 



max 
Q 

Tr{Q}<p 




{c) 

< 




max 
Q 

Tr{Q}<p 





(25) 
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Here, (a) follows from the matrix determinant equalitjO , (b) results from the fact that for any 
positive semidefinite matrix A, we have |A| < (^^^p^^ , (c) follows xom the generalization 
of the Cauchy-Schwarz inequality to the positive semidefinite matrice^ and (d) follows from 
the concavity of the logarithm function. Rephrasing ([7]), we have 



where 



n* = a^Ufnfc + z~CAr(0,P„0, 



(26) 

(27) 
(28) 



where Pn* = {J^keA" ^k^k) + Ia/- The achievable rate of such a system is 

1, 



RicBsiK) 



H 



> -Eh 

- 2 

(") 1 

> -Eh 

- 2 



log 
log 
log 



M' 



M 



a2 p 



M, fa 
ylog 



1 + a2 M 

2 \ I 



l + a^y+2^- 



H*H*^ 
log 



-H*H 



(29) 



where (a) follows from the fact that Pn* = {oP' + 1)Ia/ — i^keA^k^k) which results in 
Pn* ^ (a^ + 1)Ia/, or equivalently P^i ;^ ^^^2^Ia/. For convenience, let 

Rl{K) = ^Eh 

Since a is lower-bounded by the inverse of the threshold as a > ^J^, we have limi<-^oo y log (y-^^ 
0, or equivalently 

lim RicBs{K) - Rl{K) > 0. (30) 



log 




)1 




M 





^Assuming A and B to be M x N and N x M matrices respectively, we have + AB| — |Ijv + BA| [18]. 
''Assuming A and B to be positive semidefinite matrices respectively, we have Tr{AB} < Tr{A}Tr{B} [22]. 
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Define the events and as = ( \mn (H) > KN 



O 



4 1 log K 
K 



< 



K 



log^(A') 



. Consequently, we have 



> 1 - F[E]^] - ¥[F^ 
1 



> 1 



o 



o 



K 



O 



\og\K) 



K 



and Fi< 



(31) 



Here, (a) follows from union bound inequality and (6) follows from Lemmas 4 and 5. Assume 
the diagonal entries of A are ordered as Ai(H) > A2(H) > ■ ■ ■ > Aj\/(H). Thus, Rl{K) can be 
lower bounded as 
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Rl{K) > -F[EK,FK]En 



> 



> 



> 



= F[EK,FK]En 



log 
log 
log 



M 
P 



Ek, Fk 




P 
M 



A 2 - U^H^V 

¥ / A/ 

n ^ - 



Ek, Fk 



M 



i=l 



M-i 

iu2H^v|iin\'(H) 



Ek, Fk 



> F[EK,FK]En 



i=l 



log 

M 



P\^ 



F[Ek,Fk] Eh 

A/ 



j=i 

2^ 



M 



n^'(H) 

iVlog'(K) 1 + 



logK 



K 



Ek, Fk 



> ^[Ek,Fk] 
M 



M 



log 



P 
M 



M 



5^ Eh log(A,,(H)) 



i=l 



Ek, Fk 



N\og{K) 
M 



1 + 



id) 

> F[EK,FK]{f\o^ 



P 
M 



log(K) 



M 

+ —\og[KN 



(32) 



1 + 



4/logK 



K 



M 



N\og{K) 



1 + 



log 



I- 



M / 



log(ir) 
1 



(33) 



{g) M, fKNP\ 



log 



O 



log(ir) 

1 

log(/^) 

1 

log(AO 



1 + 



\og\K) 



K 



(34) 
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Here, (a) follows from an upper-bound on the determinant expansion of A2 - U;^H^V, 
expanded over all possible set entries between A and U^H^V, (b) follows from the fact that 
the Frobenius norm of a matrix is an upper-bound on the square of the maximum absolute 
value among its entries and also Vi : Aj(H) > Ainin(H), (c) follows from the fact that the 
expectation is derived conditioned on the events Ek and Fk, (d) holds due to the fact that 
conditioned on Ek, we have Aj (H) > KN 



1 + Of 



(e) follows from the fact that 



log (1 + ~ O (y^) ~ o (i^), (/) resuks from (EB, and finally, (g) 

follows from the fact that O (^^^) ~ o (y^)- Now, defining Rg (K) = flog (^), 
according to (|301 ) and (|34l) , we have 

lim RjcBs{K)-Rs{K)>0. (35) 

K — »oo 

Furthermore, we have: 

lim Cu*{K) - Rs{K) = 0. (36) 

if— >oo 

Comparing ([35]) and and observing the fact that Cu{K) > Cicbs{K), results in ^ 
and this completes the proof. ■ 

Corrolary 1 The capacity of parallel MIMO Relay network, the point-to-point capacity of the 
cut-set defined on the uplink channel, the achievable rate of amplify and forward relaying, and 
the achievable rate of ICBS, all converge to f log { ^^j^ ), cis K 00. 

Proof: Defining C{K), Cu{K), Raf{,K), and Rjcbs{K) as the capacity of parallel MIMO 
Relay network, the point-to-point capacity of the cut- set defined on the uplink channel, the 
achievable rate of the amplify and forward relaying, and the achievable rate of ICBS, respectively, 
it is clear that 

Ricbs{K)<Raf{K)<C{K)<Cu{K). (37) 
Relying on Theorem [H we know 

lim Cu{K) - Rs{K) = lim Ricbs{K) - Rs{K) = 0. (38) 

A"— >oo X— >oo 
^det (A) = J2tv aiiria2,r2- ■ -o-rnvn < J2tv W'i-TviO-2-K2- ' ■an-Kn\, where a is the parity function of permutation. 
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By observing that Raf{K) and C{K) are sandwiched between Rjcbs{K) and Cu{K), Sandwich 
theorem tells us that 

lim Raf{K) - Rs{K) = lim C{K) - Rs{K) = 0. (39) 

ii"— >oo ii"— >oo 



Corrolary 2 Achievable rate oflCBS is at most O {j^^^j^ below the upper-bound correspond- 
ing to the cut-set defined on the point-to-point uplink channel, i.e. Cu{K). 

Proof: Following the proof of Theorem [H we observe 

Cu{K) - RicBsiK) < ARi + Ai?2 + AR-,, (40) 

where ARi = ^ log (l + ^) results from the approximation of the first term in (|29l ). Ai?2 = 
'^{t^) <E4l), and finally, AR-^ = f\og{l + j^) ~ O {^) is the difference be- 
tween Cu*{K) and Rs{K). We know that a > = ^/Plog{K), and as a result, ARi = 
^ log ^1 + piog(j^) j ~ O f iojj^^-) j • Comparing the values of ARi, 1 < z < 3, we conclude that 
C^(K)-RjcBs{K) = ol^y m 
Apart from increasing the rate, using parallel relays also increases the reliability of the trans- 
mission. As the following corollary shows, the probability of outage when sending information 
at the rate O (^j^^^p^ j below the ergodic capacity approaches zero, as K oo. 



Corrolary 3 Consider the parallel MIMO relay network and ICBS with the threshold value 
P = — Ti7\- We have 



P 



- log 

2 ^ 



< a(Ao + O 



\og{K) 



O 



log^ (K) 



K 



Proof: Following the proof of Theorem 1, we observe this outage event is a subset of 
U whose probability is shown to be O 



Another interesting result is that by increasing the number of relays, each relay can operate 
with a much lower power as compared to the transmitter, while the scheme achieves the optimum 
rate. This shows another benefit of using many parallel relays in the network. 



Theorem 2 Up to the point that Pr{K) = a; log^ {K)), the achievable rate of ICBS satisfies 

fKNP\ 



M fKNP\ 

lim RicBs{K) - Cu{K) = lim Ricbs{K) - — log = 0. (41) 

A"— >oo K^oo Z 



February 1, 2008 DRAFT 



21 



Proof: We use the same steps as the proof of Lemma 4 with the same values of 7 and ^ 
Rewriting (|2TI) . we have 

21og(K) 



[v> i]< MNK\og\K) 



(42) 



where (5 = ^. In order that the second term in (l34l) (or equivalently Ai?,2 in (l40l) ) approaches zero, 
we must have P[i?i^, Fk] ~ l+o (^ iog(j^) j ' which implies that P[^; > .^1 ~ l+c" ^ iog( j<) j • From the 
above equation, it follows that having /9 ~ c<j ^ ^°g^^) j incurs that ^fd = ~ o ^y^^^^^ > 

or equivalently, v^log^(K) ~ {j^^^j^^, which results inP[i;>^]~l + o {j^^^jq^- Moreover, 
the first term in (|29l ) (or equivalently Ai?i in (1401) ) approaches zero, if Pr{K) = uj{/3) (or 
equivalently, a ~ c<;(l)). Therefore, having Pr{K) ~ u results in Ai?i, Ai?2 — > 0, 

which implies that limi^-^oo Cu{K) — Ricbs{K) = 0. ■ 

Theorem 3 The proposed Cooperative Beamforming scheme and its variant achieve the maxi- 
mum multiplexing gain of the relay channel. More precisely: 

«CBS(P) ^ M 
p-00 log(P) 2 ' ^ ^ 

and Y ^he maximum achievable multiplexing gain of the underlying half duplex system. ( Here 

RcBs{P) is the achievable rate of the proposed scheme for the given power constraint P.) 

Proof: We prove the theorem for CBS. The statements of the proof are also valid for the 
variant of CBS. First of all, from the last theorem, we have 

C.(P) < CAP) < y log (— ) = y log (— ) + y log(P) + -. (44) 

Here, (a) follows from the assumption that P is large enough such that we have P > Thus, 
the maximum achievable multiplexing gain is 

P^oo log(P) - 2 ^ ^ 

To prove the theorem, it is sufficient to show that the multiplexing gain of CBS is lower bounded 
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by To show this, we lower-bound the achievable rate of the scheme as follows: 



RCBS{P) 



-E 



G,H 



> 



-E 



G,H 



log 
log 



Im + 



a 



P 



a 



l + a^ M 



A 



l + a^ M 



MM MM 
> — log(P) + —En [log (A^in (H))] - — log(M) - — Eg,h 



log 1 + — 



W M, , , M^ 
> — log(P) + — 



x=0 



M 

e ^ log{x)dx — — log(M) 



M, 



-E, 



G,H 



log 1 + 



M, M2 M, M^ 

> — log(P) - — - Y log(M) - —Eg,h 



log U + 



(46) 



where (a) follows from the fact that Amin(H) > Amin(W), where W is an arbitrary M x M sub- 
matrix of H, noting that /A„,i„(w) (A) = Me'^'^, A > 0. Now, defining = Eg,h [log (l + ^)] , 
it is sufficient to show that Xa can be upper bounded by a finite expression independent of P. 
Defining x^^k = log [l + A^^J^ (Gfe) (||H^.f + ;^)], we have 



E, 



G,H 



log 



maxi<fc<i^ Ex,n., 



1 + 



/J 



E, 



G,H 



(a) 



max log 

l<k<K 



E. 



'x,nfc 



1 + 



P 



E, 



G,H 



max log I 1 + Xj^ (Gk 

l<k<K 



max Xa k 

l<k<K ' 



iH,r + - 



K 



k=l 



i^'EG,H [Xa,k] 



(47) 



Here, (a) results from matrix product norm inequality and independency of from and x, 
and (b) follows from the fact that Xa,k's are nonnegative i.i.d. random variables. Without loss of 
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generality, we can assume P is large enough such that P > 1. We can upper-bound E [xq, fc] as 

E [Xa,k] = EGfe,Hfc i log 



2 , 1 



< Eg„h, [log (1 + XmUGk) + l|H,f )] - Eg, [log (A^i„ (6^))] 

< Eg, [A^in(Gfc)] + Eh, [HH^f ] - Eg, [log (A^,, (G,.))] 

< N + MN-M / e-*^^log(x)dx 

< MN + M + N. (48) 

Here, (a) follows from the assumption that P > 1, (b) follows from the fact that log(l + x) < 
X, and (c) follows from the fact that E [Amin(Gfc)] < E 
Comparing (ggl), (HV]), and (gS]), we have 



M 



N, and also (a) in (gg 



As a result 



i?Ci?s(P)> Ylog(P) + 0(l). (49) 



. U,„ > |. (50) 



Comparing (1451) and (1501) completes the proof. ■ 
Remark - It is claimed in [1] that the proposed BNOP scheme achieves the full multiplexing 
gain of ^, for K ^ oo. However, it should be mentioned that this result is not valid for the 
asymptotically large values of SNR, for any fixed number of relays. Moreover, it can easily be 
shown that the interference term increases linearly with SNR, and as a result, the SINK term 
is limited by a constant value for large SNR values. Therefore, the multiplexing gain of BNOP 
scheme is zero for any fixed number of relays. 

V. Simulation Results 

Figure m shows the simulation results for the achievable rate of ICES, BNOP matched filtering 
scheme [1], and the upper-bound of the capacity based on the uplink Cut-Set for varying number 
of relays. The number of transmitting and receiving antennas in the relays, the transmitter, and 
the receiver is M = = 2, and the SNR is P^ = P, = 1MB. While both of the schemes 
demonstrate logarithmic scaling of rate in terms of K, we observe that there is a significant gap 
between the BNOP scheme and our scheme, reflecting the gap of 0(1) in the achievable rate 
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of [1]. On the other hand, the gap between ICBS and the upper-bound rapidly approaches zero 



due to the term O 



1 



log{A') 



predicted in Corollary 2. 



O 



BNOP scheme 
Upper Bound 
ICBS method 

\ 

90 



40 50 60 

Number of Relays 



Fig. 4. Upper-bound of the capacity, ICBS, and BNOP matched filtering Scheme vs. number of relays in parallel MIMO relay 
network 



VI. Conclusion 

A simple new scheme, Cooperative Beamforming Scheme (CBS), based on Amplify and 
Forward (AF) strategy is introduced in a parallel MIMO relay network. A variant of CBS, 
called Incremental Cooperative Beamforming Scheme (ICBS) is shown to achieve the capacity 
of parallel MIMO relay network for K ^ oo. The scheme is shown to rapidly approach the 



upper-bound of the capacity with a gap no more than O 



1 



. As a result, it is shown that 



1 



\og{K) 



in 



the capacity of a parallel MIMO relay network is C{K) = ^ log (l + ^^^) + O 
terms of the number of relays, K. Moreover, it is shown that as the number of relays increases, 
the relays in ICBS can operate using much less power without any performance degradation. 
Finally, the proposed scheme is shown to achieve the maximum multiplexing gain regardless of 
the number of relays. The simulation results confirm the validity of the theoretical arguments. 
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Appendix A 

Proof of Lemma 2 

Let us denote Wj as the ith column of W. In [23], it has been shown that 

T{NK) 



/|iw,||2(a^) 



z = l, 



,M, 



(51) 



T{N)T{NK - N)' 

which corresponds to the Beta distribution with parameters and NK — N. Therefore, we have 



M 



< P UaxllWill^ > ^ 



L i 

M 



i=l 



(a) 

< MP [J-,], 



(52) 



where (a) results from the Union bound on the probability, and jFj = ||Wjp > jj. Defining 



7' = and using (|5TI) . we obtain 



P[||Wf>7] < M(l-i^,w.|p(y)) 

T{NK) 



(a) 



M 
M 



T{N)T{NK - N) 

T{NK) 
T{N)T{NK - N) 

N-l 



1 



NK -N 



N 



E 

n=l 



n 



NK -N 
(N - j) 



+ 



MY. 



\{NK-N + 3] 



{NK 



^ {N -n)\{NK - N + n-l)\ 



y(Ar_„_i)^^ _ ^i-^NK-N+n 



N 



< 



MY, 



n=l 



(iVAY)^~"(l - 7) 
{N-n)\ 



NK~N 



(b) M{NKiY-\l-i) 

rsj 

(N-iy. 



/\NK-N 



1 + 



Ki 



(53) 



where (a) follows from the integration by part, and (h) follows from the fact that K'y' uj{\) 
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Appendix B 

Proof of Lemma 5 

The (^, j)th entry of AA^, denoted as [AA^jj ,,-, can be written as 

[AA ]jj = a^a^- , 

where a^ is the vector representing the ith row of AA^. Let us define B as 

B = [bf|---|b^ 



(54) 



(55) 



where bj 



a,; 



-, i = 1, 



, r. We have 



1 1 = ] 



(56) 



where = bjbj^ 

Lemma 3, as 



a,ja|^ 



I 1 1 1 I 



P — ^. The pdf of z{i,j) = |7(^,i)P has been computed in [23], 



Let us define C as the event that z{i,j) < for all i ^ j. Using (|57l) . we have 

P[C] = P 



(57) 



n 



> 1 - ^ 



l + 0(e- 



(58) 



where (a) results from the Union bound on the probability, noting that z{i,j) = z{j,i), 
Conditioned on C, the orthogonality defect of B, defined as rl'' V , can be written as 



BB 



1 



1 + 



(59) 
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where 5c (B) denotes the orthogonality defect of B, conditioned on C. Hence, using the fact that 
the orthogonality defect of A and B are equal, conditioned on C we can write 



A A 



HI 



1=1 



riii-rh«(75 

i=l '- * 



(60) 



where Ai(A)'s denote the singular values of AA^. Moreover, 

r 

^A.(A) = Tr{AA^} 



1=1 



E 



i=l 



Now, let us define events Vi as follows: 

Vi = - e) < ||aif < s(l + e)} , i = 1, 



(61) 



(62) 



where e 



21og(s) 



. Since ||aj|| = X]f=i where denotes the {i,j)th entry of A, and 



having the fact that |aijp are i.i.d. random variables with unit mean and unit variance, using 
Central Limit Theorem (CUT), ^||ajp approaches, in probability, to a Gaussian distribution with 

111 II 2 

unit mean and variance 7, as s tends to infinity. More precisely, defining X = ° and using 
Theorem 5.24 in [24], we have 



P 



21og(s) <X < V21og(s 



l-$fV21og(s) 



73 



exp 



73 



-$(-v/2Mi))exp ^^3^ 



+ 



(a) 



1 - 



1 + 




o 



S^/s 



(63) 



where $(.) denotes the CDF of the normal distribution, and and 73 denote the second and 
third moments of |aj^j p, respectively, (a) follows from the approximation of $(2:) for large x by 
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/2-KX 



-e 2 



and the fact that a ~ 73 ~ ©(I)- From the above equation, P[Pj] can be computed 



as 



P 



1 - e < - a,/ 
s 



O 



< 1 



(64) 



in which we have used the definition of e which is y ^^^7^- Conditioned on C and V, where 
V = Ol^^Vi, and using (l60l) and (|6TI) . we can write 

ni=i A. 



V 



n:=i Hi+o{e))] 



1+0 



0(e) 



O 



log(s) 



(65) 



where A = ^ Y7i=i Ai- Suppose that Amin = aX (a < 1). We have 



(a) aX[^{rX-aX)] 



r-l 



a(r — a 



A 



Vr-1 



(66) 



where (a) follows from the fact that knowing Amin, the product of the rest of the singular 
values is maximized when they are all equal. Hence, having the sum constraint of rX yields 
111=1 Aj < aA [;3y(^A — aX)Y ^ Using (l65l) . and noting that f{a) = is an increasing 

function of a over the interval [0,1], and writing the Taylor series of /(a) about 1, noting 

/'(I) = and r (1) 



we have 



a(r — a 



ir-l 



(r - 1 



\r—l 



0\ 



log(s) 



r(l — a) 
2(r - r 



O 



log(s) 



a ~ 1 + 01 



log(s) 



(67) 
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In other words, conditioned on C and V, it follows that Amin = A 
conditioned on V, we have A = s 1 + 



1 + 



p 



A. 



1 + 




. As a result, 

> P[C n p] 

i P[C]P[P] 

P[C] (P[P,])^' 

1581,1641 L _ 

~ 1 + 0(e 

1 + 01 



4/log(s) 



o 



. Moreover, 



(68) 



^s^log(s) 

where (a) follows from the fact that the norm and direction of a Gaussian vector are independent 
of each other, and as a result, C and V are independent. (6) follows from the fact that Pj's are 
independent and have the same probability. 
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